Differential expression in SAGE: accounting for normal between-library variation

نویسندگان

  • Keith A. Baggerly
  • Li Deng
  • Jeffrey S. Morris
  • C. Marcelo Aldaz
چکیده

MOTIVATION In contrasting levels of gene expression between groups of SAGE libraries, the libraries within each group are often combined and the counts for the tag of interest summed, and inference is made on the basis of these larger 'pseudolibraries'. While this captures the sampling variability inherent in the procedure, it fails to allow for normal variation in levels of the gene between individuals within the same group, and can consequently overstate the significance of the results. The effect is not slight: between-library variation can be hundreds of times the within-library variation. RESULTS We introduce a beta-binomial sampling model that correctly incorporates both sources of variation. We show how to fit the parameters of this model, and introduce a test statistic for differential expression similar to a two-sample t-test.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical evaluation of SAGE libraries: consequences for experimental design.

Since the introduction of serial analysis of gene expression (SAGE) as a method to quantitatively analyze the differential expression of genes, several statistical tests have been published for the pairwise comparison of SAGE libraries. Testing the difference between the number of specific tags found in two SAGE libraries is hampered by the fact that each SAGE library is only one measurement: t...

متن کامل

An anatomy of normal and malignant gene expression.

A gene's expression pattern provides clues to its role in normal physiology and disease. To provide quantitative expression levels on a genome-wide scale, the Cancer Genome Anatomy Project (CGAP) uses serial analysis of gene expression (SAGE). Over 5 million transcript tags from more than 100 human cell types have been assembled. To enhance the utility of this data, the CGAP SAGE project create...

متن کامل

Serial Analysis of Gene Expression (SAGE) - Sequencing Errors

Serial Analysis of Gene Expression (SAGE) is a technique to study overall gene expression in different (normal or disease) tissues. Results take a form of a so-called SAGE library for each of the tissues studied. A SAGE library is a set of text-strings (typically 10base-pairs long), called tags. A tag is representative for a gene that is active in a particular cell or tissue. From a statistical...

متن کامل

P-84: Evidence for Differential Expression of The Pluripotency Factors c-MYC, KLF4 and LIN28 in Normal Endometrium and in Endometriosis

Background Endometriosis is a common gynecological disease characterized by the presence of endometrial tissue outside the uterine cavity. This disease affects approximately 10% of women in reproductive age and is associated with pelvic pain, dysmenorrhea and infertility. The theory of involvement of stem cells is a considered new hypothesis in etiology of endometriosis.The aim of this study wa...

متن کامل

Comparative Enumeration Gene Expression

This paper is about differential gene expression measured by transcript counting methods such as SAGE or MPSS. It introduces two significance tests for detection of differential expressed tags: frequentist and Bayesian. Under the frequentist view, it is proposed a test that computes the critical level as a function of each tag total frequency. Under the Bayesian view the Full Bayesian Significa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 19 12  شماره 

صفحات  -

تاریخ انتشار 2003